Volume 1 14, Number 2, March-April 2009 

Journal of Research of the National Institute of Standards and Technology 

[J. Res. Natl. Inst. Stand. Technol. 114, 83-97 (2009)] 

Issues in Optical Diffraction 
Theory 



Volume 114 



Number 2 



March-April 2009 



Klaus D. Mielenz 

National Institute of Standards 
and Technology (ret.) 
Gaithersburg, MD 20899-8440 

kmielenz(2hvildblue.net 



This paper focuses on unresolved or 
poorly documented issues pertaining to 
Fresnel's scalar diffraction theory and its 
modifications. In Sec. 2 it is pointed out 
that all thermal sources used in practice 
are finite in size and errors can result from 
insufficient coherence of the optical field. 
A quarter-wave criterion is applied to 
show how such errors can be avoided by 
placing the source at a large distance from 
the aperture plane, and it is found that in 
many cases it may be necessary to use 
collimated light as on the source side 
of a Fraunhofer experiment. If these 
precautions are not taken the theory of 
partial coherence may have to be used for 
the computations. 

In Sec. 3 it is recalled that for near-zone 
computations the Kirchhoff or Rayleigh- 
Sommerfeld integrals are applicable, but 
fail to correctly describe the energy flux 
across the aperture plane because they are 
not continuously differentiable with 
respect to the assumed geometrical field 
on the source side. This is remedied by 
formulating an improved theory in which 
the field on either side of a semi-reflecting 
screen is expressed as the superposition 
of mutually incoherent components 
which propagate in the opposite directions 
of the incident and reflected light. 
These components are defined as linear 
combinations of the Rayleigh-Sommerfeld 
integrals, so that they are rigorous 
solutions of the wave equation as well as 
continuously differentiable in the aperture 
plane. Algorithms for using the new theory 
for computing the diffraction patterns of 
circular apertures and slits at arbitrary 



distances z from either side of the aperture 
(down to z = ± 0.0003 A) are presented, 
and numerical examples of the results are 
given. These results show that the 
incident geometrical field is modulated by 
diffraction before it reaches the aperture 
plane while the reflected field is spilled 
into the dark space. At distances from the 
aperture which are large compared to the 
wavelength A these field expressions are 
reduced to the usual ones specified by 
Fresnel's theory. In the specific case of a 
diffracting half plane the numerical results 
obtained were practically the same as 
those given by Sommerfeld's rigorous 
theory. 

The modified theory developed in this 
paper is based on the explicit assumption 
that the scalar theory of light cannot 
explain plolarization effects. This premise 
is justified in Sec. 4, where it is shown 
that previous attempts to do so have 
produced dubious results. 
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1. Introduction 

Calculations pertaining to the diffraction of light by 
an aperture are typically based on geometric assump- 
tions as illustrated in Fig. 1 . A plane screen JS contain- 
ing an aperture osf of width 2w is illuminated by a 



quasi-monochromatic source <^"of width 2s and 
circular wave number k = 2n/X, P is a source point, 
Q is a point inside the aperture, and P is the point at 
which diffraction is observed. A point inside the 
aperture (usually the center) is chosen as the origin O, 
and cartesian or polar coordinates are used so that 
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p o=Oo>.>V z o) =r o( cos 0o sin e o> sin 0o sin e o> cos e o>> 
P=(x, j,z)=r(cos</>sin #,sin 0sin 0,cos 0). 




Fig. 1. Geometry pertaining to Fresnel's diffraction integral. 

The first mathematical theory of diffraction was 
derived in Fresnel's 1819 Memoir on the Diffraction 
of Light [1] from a surprisingly minimal set of assump- 
tions: 

Monochromatic light is a harmonic wave motion that 
can be described in terms of a scalar wave function, 
[/(P)exp(- icot), where |C/(P)| 2 is the irradiance at P, 
(0= like, is the circular frequency k = 2%/X, is the 
circular wave number and c is the speed of light. 



The summation of Huy gens' wavelets is best carried 
out on a wave front that coincides with the aperture; 
otherwise the computations will be too complicated. 

A plane aperture is assumed, which implies that the 
source must be distant as well as paraxial (r > 2w 2 /X, 
cos ~ 0), 1 so that the incident wave front can, with- 
in acceptable tolerances, be assumed to lie inside the 
aperture. 

The field on the source side of the screen is not 
affected by the presence of the screen (St. Venant's 
hypothesis). 2 For an infinitesimally small source 
with radiant intensity / which is located at the point 
P in Fig. 1, this leads to t/(Q) = V7exp fikP Q)/P Q. 

The Huygens' wavelets originating at Q are 
anisotropic spherical waves that oscillate a quarter 
period ahead of the incident field and whose ampli- 
tudes are scaled by the factor \IX. As the correspon- 
ding effect at the point of observation P is attri- 
butable to the central Fresnel zone at Q acting alone, 
there is no need to know the nature of this aniso- 
tropy and hence it follows that dC/(P) = (i/X)U(Q) 
expfikQP)/QP. 

On account of the denominator XQP in the last 
expression it is necessary to assume that QP is a large 
distance (r » X), and that P must also be paraxial 
(cos#= 1) as the direction of observation cannot be 
substantially different from the direction of inci- 
dence. 

Combining these expression, Fresnel found 

C/ F (P) = ^fdQ eXp[i ^ e + e/,)] 

A J 



* L ~ p oQQ p 



(2) 



iV^geom exp[ik(r +r)] 



J c/Qexp[iM(0], z»X, 



e£/ 



where E gmm = I/r t } is the incident geometrical irradi- 
ance of the aperture plane and A(g) is the path differ- 
ence (P Q + QP) - (r„ + r). 

Although a relic of the early 19 th century, the Fresnel 
diffraction integral (2) is still used today in its original 



This inequality for r follows from Eq. (4a) for s = 0. 

Jean Claude Saint-Venant (1797-1866) taught mathematics at the 
Ecole des Ponts et Chaussees in Paris, where Fresnel studied engi- 
neering between 1806 and 1809. 
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form and has remained a most useful, reliable tool for 
diffraction calculations that have consistently yielded 
results which agree with experience and are well docu- 
mented [2]. Accordingly, it is commonly regarded as a 
cornerstone of diffraction theory and its validity for mid- 
field applications remains unchallenged. On the other 
hand, the classical theory of optical diffraction is limited 
in scope as well as physical significance by several 
issues which are still unresolved, misunderstood, or 
poorly documented. For example: 

The assumption of an infinitesimally small isotropic 
point source is seldom justified. All thermal sources 
are finite in size, so that errors may arise from an 
insufficient coherence of the optical field. As shown in 
Sec. 2, below, this can be avoided by invoking the 
theory of partial coherence, or by negating the error in 
the first place by designing a diffraction experiment so 
that the aperture illumination is "almost" coherent. 

As Fresnel's integral is not applicable for computa- 
tions in the near zone, it would seem that in this 
case the more accurate Rayleigh-Sommerfeld or 
Kirchhoff boundary-value integrals [Eqs. (5a-c), 
below] can be used with confidence. This is, however, 
not the case as these integrals do not correctly describe 
the field in the proximity of the aperture screen. In 
Sec. 3, this problem is solved by constructing an 
improved theory in which St. Venant's hypothesis is 
abandoned and the field on both sides of the screen is 
expressed in terms of linear combinations of the 
Rayleigh-Sommerfeld integrals. 

The standard solutions of the Fresnel diffraction inte- 
gral (2) for circular apertures and apertures bounded 
by straight edges are models of mathematical ele- 
gance, but notoriously difficult to evaluate in practice. 
The numerical methods used in this paper not as ele- 
gant, but easier to use. As they are rigorously 
correct, they can be used at arbitrary distances from 
the screen, and this is recommended. 

Initially, Fresnel believed that light is akin to longitu- 
dinal sound waves but prior to the completion of his 
theory he discovered, in collaboration with Arago, 3 
that it is a transverse wave. He pondered the obvious 
question how this discovery affected the theory of 
diffraction and concluded that "the arguments and 



At the time of this collaboration with Fresnel, Francois Jean 
Dominique Arago (1786-1853) was Secretary of the Paris 
Observatory. For a brief time in the Spring of 1848 he served as the 
Head of the Second French Republic. 



computations contained in the Memoir harmonize quite 
as well with this new hypothesis as with the preceding, 
because they are independent of the actual directions of 
the vibrations." hi other words, the scalar approach 
makes it impossible to describe the diffraction of 
polarized light. This is confirmed in Sec. 4, below. 

2. Coherence Issues 

Extended thermal sources employed in diffraction 
experiments are used either by themselves or followed 
by a limiting aperture. In either case, the diffraction pat- 
tern due to the principal aperture osf can be evaluated 
using a generalized Fresnel integral derived by this 
author from the general equations for the propagation of 
cross-spectral density in a partially coherent optical field 
[3]. The case of an incoherent source used by itself is a 
straightforward generalization of Eq. (2) and leads to the 
following expression for the irradiance at the point P in 
Fig. 1, 



E(?)= 



—^j dP Z(P ) | J* dQ exp[ ik( P Qt QP)] | ,(3a) 



where L (P ) is the source radiance. In the case of a 
thermal source followed by an aperture located in the 
plane <S*, it is necessary to use the van Cittert-Zernike 
theorem to calculate the cross-spectral density W(P , P , co) 
on d?and then find E(P) from the expression 



E(P) = 



MdPodP c ' 

r i 



^(P ,P ,<9) 



mm 

j dQdQ' exp[iA;(/> Q+e/>- P Q<+ Q P], 



(3b) 



• s" 



where P and P ' are two points on S 9 , Q and Q' are 
points in osf, and (O is the circular frequency of the light. 
These equations were applied in Ref. [4] to the specific 
case of concentric circular apertures and yielded compli- 
cated but closed solutions in terms of Lommel functions. 
The use of Eqs. (3a) and (3b) for practical computa- 
tions is a very tedious task. Therefore, it may be desir- 
able to avoid the need for these computations in the first 
place by designing diffraction experiments so that the 
aperture illumination will be "almost" coherent and the 
Fresnel integral (2) can still be used, in spite of the finite 
size of a given source. This can be done as follows, 
assuming the worst case of a totally incoherent source as 
implied by Eq. (3 a), considering a concentric arrange- 
ment of source and aperture as in Fig. 1, and then 
applying the quarter-wave criterion so that the 
path lengths P Q do not vary by more than A/4. 
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According to Fig. 1, the extreme values of P Q are 

r and ^r^ +{s+ w) 2 ~ r + (s+ w) 2 /2r , and thus the 
desired criterion is (s + w) 2 /r < XI 2 or 



'0 



> 2(s + wf 



(4a) 



As shown in Table 1, the corresponding minimum 
distances r 0mfa vary significantly with the sizes of 
source and aperture, ranging from a few millimeters for 
small sources and apertures to hundreds of meters for 
large ones. In the latter case Eq. (4a) can only be 
satisfied by placing the source in the focal plane of a 
collimator lens, as on the source side of a Fraunhofer 
diffraction experiment. 4 



Table 1. Smallest permissible source-aperture distance r 0mil defined 
by Eq. (3a) for selected aperture sizes 2w (left column) and source 
sizes 2s (top row), A = 1 |^m 



2w/2s 


0.01 mm 


0.1 mm 


1 mm 


10 mm 


0.1 mm 


6 mm 


20 mm 


605 mm 


50.1m 


1 mm 


510 mm 


605 mm 


2m 


60.5 m 


10 mm 


50 m 


51 m 


60.5 m 


200 m 



The coherence criterion (4a) assures a coherent and 
uniform aperture field so that the path lengths 
( p oQ + QP) are all effectively equal to (r + QP), as if 
the incident field is a plane wave propagating in the 



direction of the z-axis and U geom = JE geom exp(ikz). 

Under these conditions Eq. (3 a) is reduced to a product 
of independent integrals over source and aperture, 

and it is easy to show that the first of these, dP L{ P ) 

is equal to the geometrical irradiance of the aperture 
plane if Lambert's law is assumed. That is, 



E(P) = — [ — J" dP Z(P ) x | J dQ exp[i K r + QP)] 



(Xr) 
p 

geom 

~{Xrf 



aal 



|JdQexp(i^P)| 2 , 



(4b) 



&i 



which is the squared modulus of Eq. (2) for P Q = r 



3. Rigorous Theory 
3.1 Background 

Fresnel was aware that the spherical wavelets 
assumed in his derivation of Eq. (2) cannot be iso- 
tropic, because otherwise light would also travel back 
toward the source. To avoid this contradiction, so-called 
inclination factors can be introduced to assure that the 
amplitudes of the wavelets are zero in the reverse direc- 
tion. As mentioned earlier, Fresnel did not know the form 
of these factors and simply omitted them, assuming 
correctly that they are not needed in a paraxial theory. 

The question of inclination factors was answered by 
the publication of Helmholtz' theorem in 1859 [5]. 5 
Whereas Fresnel had only stipulated that the screen S 
in Fig. 1 must be large enough to prevent the leakage of 
light around its edges, Helmholtz imagined it to be an 
infinitely large, closed surface which does not contain 
the primary source, and then invoked Green's formula to 
express the diffracted field C/(P) as a surface integral of 
the form prescribed by Fresnel's first assumption. This 
provided the missing inclination factors in the form of 
the normal derivatives dU(Q)/dn, which vanish when the 
light propagates in the direction tangential to the screen. 
In short, Helmholtz' theorem states that the diffracted 
field is confined to the inside of the surface &" but null 
on the outside, and thus it merges Huygens' principle and 
St. Venant's hypothesis into one. 

The principal solutions of Helmholtz' theorem are 
the familiar Rayleigh-Sommerfeld and Kirchhoff 
diffraction integrals [6,7,8], 



^(P): 



In 



\ dQ dU ^ 



(Q) exp(ikQP) 



'■*' 



ik^ 



geom 



2n 



JdQ 



dz QP 

exp(ikQP) 



(5a) 






QP 



z>0, 



^(P): 



J_ 

In 



[dou ( q)A(£M^2) 



'% 



QP 



1 W$ 

z>0, 



ik dz 



£/k(P) = }[^ ) (P) + £4 ) (P)]. 



(5b) 



(5c) 



Equation (4a) is more stringent than a well-known coherence 
criterion attributed to Verdet (1869) which specifies different mini- 
mum distances, r 0fAl = Ssw/h. These are also quite large for large 
apertures and sources. 



where the right-most expressions in (5a,b) were 
obtained by assuming that the coherence criterion (4a) 



Helmholtz' theorem was formulated as a mathematical theory of 
organ pipes, and thus also implies the assumption of longitudinal 
waves. 



86 



Volume 1 14, Number 2, March-April 2009 

Journal of Research of the National Institute of Standards and Technology 



is satisfied so that jE geom exp(ikz) could be substituted 
for C/ g eom(Q). It is well known and easy to show that in 
the mid zone these expressions are all reduced to the 
Fresnel integral (2) so that 

U$(P) = U&(P) = U K (P) = U 1 ,(P), *»A. (5d) 

It is equally well known that the Rayleigh-Sommerfeld 
integrals reproduce the boundary values assumed in 
their derivation, 



a^(Q) 



•■*& 



Teom 



(5e) 



^(Q)-V^ 



'RS< 



geom 



Qe 



but not the corresponding values, t/^g{Q) = jE geom 

and 3 £/£g (Q) / dz = ikJE geom . Therefore, the wave func- 
tions defined by Eqs. (5a-c) are different from one 
another in the near zone and not continuously differen- 
tiable in the aperture, so that none expresses the 
diffracted field as an analytical continuation of the 
assumed geometrical field on the source side. 

In order to overcome this failure of the classical 
boundary-value theories to describe a smooth flow of 
energy across the aperture plane, so-called "rigorous" 
theories have been formulated in which St. Venant's 
hypothesis is abandoned and it is assumed that the 
incident geometrical is modified by diffraction before it 
reaches the screen. Hence, its values on the source side 
can be determined by postulating that the overall field 
is continuously differentiable inside the aperture. The 
most important, and by far most successful treatment of 
this type is Sommerfeld's rigorous theory of diffraction 
by a half plane [7, 9], which is expressed in closed form 
and involves no approximations of any sort. Other 
examples are the Rayleigh-Bouwkamp [6, 10] and 
Levine-Schwinger [11] theories, which were intended 
to define the transmission coefficients of very small 
apertures but are approximations based on assumed 
aperture field distributions expressed as a series of 
algebraic functions. Undoubtedly, the rigorous treat- 
ment of diffraction problems is more powerful than the 
boundary- value approach but unfortunately some of the 
most common problems, such as diffraction by circular 
apertures or slits of arbitrary sizes, have so far not been 
solved rigorously. 

The reasons for this deficiency of the prior literature 
were analyzed in a recent study [12] which included a 



numerical comparison of the respective half-plane 
results obtained from the Sommerfeld and Rayleigh- 
Sommerfeld theories. The theory derived in the follow- 
ing subsection is based on the findings of this study, 
and it is important that it involves concepts not found in 
other theories: 

The frequently encountered association of the 
Rayleigh-Sommerfeld integrals with parallel or per- 
pendicularly polarized light is abandoned and noth- 
ing is assumed about them, except that they are in- 
dependent solutions of the wave equation which 
satisfy the respective boundary conditions specified 
in Eq. (5e), above. 

Similarly, the customary distinction between "black" 
and "metallic" screens is abandoned. The physical 
nature of the screen is not specified, except that it has 
a certain reflectance | p | 2 which can be used as a 
scale factor to describe different types of screens. 

In the general case of a partially reflecting screen 
there will be two mutually incoherent diffraction 
patterns, one in the forward direction of the incident 
light and the other in the reverse direction of the 
reflected light. The latter is usually not observable, 
but must be taken into account because its presence 
affects the radiant flux transmitted in the forward 
direction. 

3.2 Derivations 

Consider the diffraction of light by a plane aper- 
ture o$f in an infinite, infinitesimally thin screen J? 
which is illuminated by a source <§P, as shown in Fig. 1. 
Assume that 12 is a specular reflector with amplitude 
reflectance p and that the coherence condition (4a) is 
satisfied, so that the incident geometric field and its 
reflection can be expressed in the form of plane waves, 

^geom =V jB geom exp(ifc) and U geom = p^JE geom exp(-ifc). 

Under these conditions, the diffracted field will like- 
wise be composed of mutually incoherent components 

U(V) and U(V), which travel in opposite directions. The 
task at hand is to define these components so that each 
is continuously differentiable inside the aperture. 



Using a normalized notation so that L(P) = Ji? geom w(P) 
and £/(P) = jE geom w(P), we express w(P) and w(P) 
in the form of the trial solutions, 
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,J 



«(P) 
«(P) = p 



u+(P), z>0 



[ exp(ife) + w_ (P) , z < 
m+(P), z>0 

W_ (P) , Z < 



(6a) 



where w+(P) and iJ ± (P) are initially unknown field 
components which are attributed to diffraction effects. 
As these unknown quantities must have certain 
prerequisite properties such as obeying the wave equa- 
tion as well as the infinity and edge conditions [7, 13], 
and because the Rayleigh-Sommerfeld integrals (5a, b) 
have these properties, m+(P) and w+(P) can be defined 
as linear combinations of the form 



«±(P) : 
«±(P) : 



:flII^(P)±6«^(P), 

■■au$(P)±bu<$P), 



(6b) 



where P = (x, y, -z) is the reflection of the point of 
observation P, definec 
expression when z < 0. 



observation P, defined so that Mj^ (P) is a valid 



The coefficients a, b and a, b in (6b) can now be 
determined by postulating that the forward field is 
equal to Fresnel's integral m f (P) in the positive mid 
zone and equal to the incident geometrical field in 
the negative mid zone. The respective values for the 
reverse field are assumed to be zero and £/ F (P). That is, 



«(P)= 



(a+6)« F (P)=M F (P), 



»A 



exp(ifc)+(a - b) w F (P) = exp(i kz), - z »A 



(6c) 



m=p\ (B ; b)M?)=0 > z>>x (6d) 

(a-6)w F (P) = M F (P), -z»A 



where Eq. (5d) was used to let u^ = u^ = u Y when z/X 
is large. 

Equations (6c,d) are satisfied if a=b=^, a = -b=-j, 
so that 



II (P): 



«(P) = P 



«k(P), z>0 
exp(ife) + m k (P) , z<0 
w K (P), z>0 
« K (P), z<0 



(6e) 



where 



,(p)( 



,("k 



«k(P) = -[«^(P) + «S(P)]. 



,(rt| 



■/Or 



(6f) 



"K(P) = -[<s ; ( p )-<s( p )]- 



It remains to show that these expressions are contin- 
uously differentiable inside the aperture. In the case of 
u (P), this will be the case if 



»i 



M)< 



-[«^(0 + <(Q)] = 

i+t[»m(Q)-"m(Q)]. 



(6g) 



\_d_ 

2 dz 



[4 p s ) (Q)+"rs ) (Q)] = 



\k + 



1 



,(p) 



2 3(-z) 



[<s ; (Q)-<s(Q)]> 



(p)r 



(6h) 



which according to Eq. (5e) is true. This argument can 
be repeated to prove that, likewise, w(P) is continuous- 
ly differentiable when P = Q. 

Equations (6e) represent the main result of this 
Section. The corresponding forward and reverse irradi- 
ances of the field are 



£(P)/£ geom =|«(P)| 2 = 



£(P)/£ geom =|«(P)| 



I«k(P)I 



z>0 



exp(iAz) + m k (P) , z < 



(6i) 



l« K (P)l 
l%(P)l 



z>0 



z<0 



(6j) 



Like m(P) and m(P) themselves, these expressions are 
continuously differentiable inside the aperture and thus 
imply a smooth, bidirectional flow of energy from one 
side to the other. It should be noted that, in these 
expressions, the roles of Kirchhoff's integral m k (P) and 
its counterpart m k (P) are reversed on opposite sides 
of the screen, so that they counterbalance each other 
and the discontinuities of the Rayleigh-Sommerfeld 

integrals w^g (P) are eliminated. 

The general properties of the diffracted field defined 
by these quantities can be inferred from the above- 
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mentioned fact that the differences between the 
Rayleigh-Sommerfeld integrals are pronounced only in 
the immediate proximity of the screen and vanish in the 
mid zone. Thus the forward component w K (P) in the 
lower Eq. (6i) disappears as the Fresnel limit is 
approached, and the reverse component m k (P) in the 
upper equation (6j) is an evanescent wave which also 
vanishes in the Fresnel limit. For \z\ »X, the forward 
field is the same as in Fresnel's theory and the reverse 
field is a mirror image of the forward Fresnel pattern. 

3.3 Implementation 
3.3.1 General 

The formulae of the preceding section are unsuited 
for numerical computations unless they can be reduced 
to single integrals that can be evaluated without simpli- 
fying assumptions which might degrade their accuracy. 
Fortunately, the rare instances in which this can be done 
include two cases of great practical importance: circu- 
lar apertures and apertures bounded by straight edges 
when illuminated by normally incident light. The single 
integrals obtained in these two cases can readily be 
evaluated by numerical methods based on the algo- 
rithms described in Sects. 3.3.2 and 3.3.3, below. The 
use of these algorithms on a personal computer with 
standard spreadsheet software is straightforward and 
yields numerical results that were found to be every- 
where finite except at distances \z\ <0.01 A from the 
aperture plane. 

The same algorithms can also used at distances 
z »A, and this is recommended because the standard 
analytical solutions of Fresnel's integral (2) are notori- 
ously difficult to use on a personal computer. For exam- 
ple, the algorithm defined by Eqs. (8a-c), below, is less 
tedious than implementing Lommel's analytical solu- 
tion of the Fresnel diffraction pattern of circular 
apertures in terms of infinite series of Bessel functions 
[2]. Likewise, the use of Eqs. (lla,b) is easier than the 
computation of complex Fresnel integrals by Taylor 
series and polynomial approximations [14]. The 
algorithms (8a-c) and (1 la,b) have the added advantage 
of being rigorously accurate, and it should also be 
noted that in most cases it is not necessary to compute 
the reverse field. On the other hand, the methods 
described in this paper can only be used for normally 
incident light. It should also be mentioned that, owing 
to the highly structured nature of the diffraction 
patterns, the use of any of these methods can be cum- 
bersome when wide apertures are considered. 



3.3.2 Circular Apertures 
3.3.2.1 Algorithms 

Let ABCB'A' be the rim of a circular aperture of 
radius w which is illuminated by normally incident 
coherent light and is centered on the coordinate origin, 
as shown in Fig. 2. As the corresponding diffraction 
pattern must be rotationally symmetrical about the 
z-axis it will be sufficient to consider its variation in the 
xz-plane, and thus the point of observation is chosen as 
P = (x, 0, z). The integrals (5a,b) may then be reduced 
to single integrals by defining the area elements dQ 
so that they are all concentric with the projection 
Q = (x, 0, 0) of P onto the aperture plane and coincide 
with the circles QBQ^B' shown in the figure, where 
Q ^ = (^, 0, 0) is the right-most point at which these 
circles intersect the x-axis. Under these conditions the 
phases kQP will be constant and equal to 



p=kQP=kQsP=k^(Z-x) 2 +z 2 =Jv 2 +(kz) 2 , 
v =*(€-*) 



(7a) 



everywhere on these area elements and the integration 
can be carried out over B,-x alone. As also indicated in 
the figure, these area elements are in general not fully 
contained in the aperture and must therefore be evalu- 
ated as 



dQ = 2nd(Z-x)(Z-x)(l- X /n) 
= {2nlk 2 )Avvi\-xlTt), 



(7b) 



where 2% is the angle subtended by the obstructed arc 
BQfB' and is given by 



COS£ = 



w 2 -x 2 -g-x) 2 
2x(£-x) 

_(kw) 2 -(kx) 2 -v 2 
2kxv 



(7c) 



or r\ = or jt, as appropriate, when the right-hand side 
of (7c) exceeds ± 1. Hence one finds, using Eqs. (5a,b) 
and (7a,b), 



«£(*,*) = -£jdQ^ 



(7d) 



= -iJdvvO-*/*)™^ z>0, 
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Fig. 2. Notation used for circular apertures. 



lA: dz 



=-*zJdvv(l -*/*)(! --) 



1 . exp(i^) 



(7e) 



z>0, 



so that the integrals defined in Eq. (6e) are now given by 
u K (x,z) = jdvv(l-x/7t)C(P), z>0, 



P p /r 

-i[(j+^)cos/J--p-sin/J], 



d K (x,z) = jdvv(l-x/7t)C(P), z>0, 

P P P 

" i[( }~^ )C ° S ^ + ^ Sin ^ L 



(7f) 



(7g) 



The limits of these integrals are v = to k(w + x) 
when x < w, and v = k(x - w) to k(x + w) when x > w. 
In the first of these ranges it is assumed that % = when 
v<k(x-w). 

In order to evaluate these integrals numerically, 
divide the aperture radius w into N equal elements, and 
let % — x = nwIN, x = mwlN. Hence, dv = kwIN, 
v = nkwIN, and therefore 

u K (x,z)=u K (m,z)=(^) 2 ^(n-±) (l- Xnm ) C{fi K ), 



z>0, 



■N 



'kw\ 



(8a) 



u K (x,z)=u K (m,z)=(— ) 2 ^(«-^) (1-X.J C(P„), 
z>0, " W 

where 



if n > m , 



^H,m — cos 



4 Af 2 -w 2 -(«-l) 2 



2m(«-j) 



(8c) 
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K 



(«■ 



\) (kwf 2 



(8d) 



all quantities being evaluated at the mid-points of the 
summation elements. The ranges of summation in Eqs. 
(8a) are 1 <n<m + N'\fn<m and m—N<n<m+N. 
The number N of summation elements used in these 
expressions must be large enough to ensure that the 
oscillations of e l/J in Eqs. (7d,e) are accurately sampled. 
According to Eq. (7a) and the quarter-wave criterion 
this will be achieved if the pathlength difference APQ 
between successive summation elements is less than 
XI 4, or A/5< nil. Differentiating Eq. (7a) with respect 
to v gives 



kw A j3Aj3 TtB 
— = Av = - < — — 

N v 2v 



N> 



2vkw 



(8e) 



u^(0,z)=exp(ikz)- 



z exp(ikW) 

W 



1 z 

"k (°> z ) = ex P(i fe ) " -z ex P(i kW ) - — ) , (9b) 

u K (0,z)-^exp(ikW)(\+p, 



which can now be substituted into Eqs. (6i,j) and then 
leads to the following expressions for the forward and 
reverse irradiances along the z-axis of Fig. 2, 



E(0,z) = E geom {l + -(l-p 2 

-cos[*(z-JF)](l-^)}, (9c) 
£(0,z)=l£ aeom O- J ^) 2 . 



which can now be used as follows to estimate the 
required value of N. In the immediate vicinity of the 
screen (z -> we have j3~ v, so that N > 2wl X= 100 
when w = 50A. In the Fresnel limit (z » X) one finds 
iV > 2 Vm^w/jtz where v^ is the largest value of v used 
in the computations, and therefore N>6w 1 /Xz= 150 
when w = 50X, z=100A and the farthest point of 
observation is located one aperture halfwidth beyond 
the shadow boundary (v m2x =1.5 kw). It should be noted 
that the corresponding values of N can be much larger for 
wider apertures. 

3.3.2.2 Numerical Examples 

1. Forward and Reverse Axial Irradiances. Equations 
(7f,g) can be solved in closed form for the special case 
of axial points of observation, where x = 0, % = and 



P=ky[t 



2 +z 2 



and therefore 



exp(i^) 



«^(0,z) = -i^ 2 Jd^- 

o ^ 

kW 

— — d t exp(t) =exp(i kz) -exp(i kW) , (a a ) 

kz 

t = ip, W = yj^+z 2 . 
Hence one finds, using Eqs. (7e) and (6e), 



These results are valid for arbitrary values of z and 
are plotted in Fig. 3, where the upper curve represents 
the forward axial irradiance E(0, z) computed from the 
first Eq. (9c) for w = 5 X and the lower curve represents 
the corresponding reverse irradiance E(0, z). Both 
quantities are continuously differentiable on crossing 
the aperture plane, the forward irradiance (a) being 
equal to the geometrical irradiance E geom in the negative 
Fresnel limit —z»X, oscillating rapidly in the vicinity 




-10 



zlw 



10 



Fig. 3. Forward and reverse axial irradiances, (a) E K and (b) E K , on 
opposite sides of a circular aperture of radius w = 5 A. 
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of the aperture, and tapering off beyond it. The reverse 
axial irradiance (b) is seemingly diverging from a vir- 
tual source point beyond z = - 10A, exhibits no oscilla- 
tions, and its magnitude is equal to the lower envelope 
of the forward irradiance curve. 

2. Near-Field Diffraction Patterns. The applica- 
tion of Eqs. (8a-c) for the computation of diffraction 
patterns is straightforward, and in this work standard 
spreadsheet software was used to obtain numerical 
results. As an example, Fig. 4 shows the near- field 
irradiance profiles (6i) on the opposite sides (z = ±X) of 



the previously considered circular aperture of radius 
w = 5 X. The resemblance of the central portions of these 
curves is remarkable and is attributable to the basic 
premise adopted in Sec. 3.2, where St. Venant's hypoth- 
esis was replaced with the assumption that diffraction is 
a continuous field phenomenon that occurs on both sides 
of the screen. The corresponding reverse profiles defined 
by Eq. (6j) are shown in Fig. 5, illustrating the onset of 
a reverse flow of energy on the positive side of the screen 
(z = X) as well as the fact that the reflected diffraction 
pattern on the negative side (z = -X) is simply a mirror 
image of the transmitted pattern in Fig. 4. 




-1.25 -0.75 -0.25 0.25 0.75 x/w 1.25 

Fig. 4. Forward irradiance profiles E K at distances z = ± X from a circular aperture of radius w = 5 X. 

1-5 i E/E, 




-1.25 -0.75 -0.25 0.25 0.75 x/w 1.25 

Fig. 5. Reverse irradiance profiles E K at distances z = ± X from a circular aperture of radius w = 5 X . 
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3. Aperture Field and Transmission Coefficients. For 

radiometric applications it is important to know the 
transmission coefficient of an aperture, defined as 
T = ^totai/ ^geom where 0,0,2! is the total radiant flux trans- 
mitted into the half space z > and O geom is the geo- 
metrical flux incident upon it in the absence of 
diffraction. Thus, for a circular aperture as discussed in 
this Section, 



T = - 



1 



AE, 



g eom Q5/ 



\dQE(Q)=—\dxx\u^(x,0)\' 



N- 



-^(m-|)|« K (m,0)| 2 , 



(10a) 



m=\ 



where A = nw 2 is the aperture area, dQ = 2kxAx is the 
circular area element, and u K (m, z) is given by 
Eq. (8a). The practical use of Eq. (10a) is tedious 
because it requires consecutive numerical integrations 
and also poses computational problems arising from the 
singularities of u K (x, z) in the limit z — » 0. The most 
troublesome singularities, due to the terms in 1 f/3 2 and 
1 /p 3 in Eq. (7f), can be avoided altogether by invoking 
the second Eq. (5e) so that 



u K (m, 0) : 



i[i+ M <f s W„ 



JL 



(10b) 



where u^ is singularin 1/j3, only, and z min is very small. 

Trial computations indicated that values of z min as small 
as 0.0003 X could be used without difficulty and that 
the limiting value of T defined by Eq. (10a) was 
reached at the 0.1% level for z min < 0.0003 X. 
Accordingly, the numerical result presented in the 
following were computed for z niin = 0.001 X. As expect- 
ed, the aperture irradiance distributions \u K (x, 0)| 2 
obtained from Eq. (10b) were similar to an average of 
the two curves in Fig. 5. They were everywhere finite 
and continuous and bore no similarity to the aperture 
distributions presumed by Rayleigh and Bouwkamp 
[6,10], Levine and Schwinger [11] or Wolf and 
Marchand [15]. 

Figure 6 shows the dependence of the transmission 
coefficient (10a) on aperture size for the range 
< kw < 3rc, and here it is seen that t exhibits a damped 
oscillatory behavior and quickly approaches the limit, 
T — » 1 as kw — » oo. It was estimated that this limit is 
reached within less than 1 % when w = 5 X. 



1.5 

T 

1 
0.5 



1 2 kwln 3 

Fig. 6. Transmission coefficient % of circular apertures vs aperture 
radius w. 



3.3.3 Apertures Bounded by Straight Edges 
3.3.3.1 Algorithms 

Consider a plane aperture of width (/ + r), bounded 
by parallel straight edges as indicated in Fig. 7. The 
corresponding diffraction pattern will consist of 
straight bands which are parallel to the edges, and thus 
it will again be sufficient to compute its variation along 
the x-axis. For a given point of observation P = (x, 0, z) 
and arbitrary aperture points Q = (|, r\, 0), Eqs. (5a,b) 
can now be expressed as follows [12], 



•'rs 



(X, Z) : 



lk_ 

In 
-i 

k(r-x) 



M 



d?7 exp[ik<J(% - x) 2 +r\ 2 + z 2 



M RS v*"> Z ) 



\ dvH<f>(j3), 

~-k(l+x) 

ifeVdvH^) 



(11a) 



-k(l+x) 



where z > 0, Hq W = J„ + iY„ denotes a Hankel function, 
J„ and Y„ are Bessel functions, and v as well as p are the 
same as in Eq. (7a). It follows at once that the forward 
and reverse wave functions defined in Sec. 3.2 are 
given by 
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O x Q, %-x , Q 



-►X 



Fig. 7. Notation used for slits. 



U K (x, z)=- [4s (x,z)+ U ( ^(X, z)] 



k(r-x) 



J* dvS(p), z>0, 



-k(l+x) 



1 

2 

t(r-x) 



«k <X z ) = i; ["rs } ( x > z ) - m rs( x ' z )l 



J dvS(j3), z>0, 



S(P)=^Vo(P)+jYi(P] 



(lib) 



(lie) 



To evaluate these expressions by numerical integra- 
tion define, in analogy to the definitions that precede 
Eqs. (8a), l+r = 2w, Av = kwIN, L = 2kl/Av, R = 2kr/Av, 
v = «Av, kx = mAv . Therefore, 

, R-m 

u K (x,z) = u K (m,z) = — ^ S(P„), z>0, (12a) 



kw 



n=—{L+m) 



u K (x,z) = u K (m,z)=— £ S(P n ), z>0,(12b) 



N 



n=—(L+m) 



where, as before, p n =<J[(n-±) 2 (kw) 2 / N 2 ] +(fe) 2 and 

the choice of N is governed by the same considerations 
as in Sec. 3.3.2.1. As P„ is independent of m it follows 
at once that, if u K (m, z) and u K (m, z) are known and m 
is replaced by m ± 1, the new values will be 

u K (m ± 1, z)=u K (m, z) + — [ S(P_ L+m± ,) - 5( /^_ m± ,)], 



z>0, 



(12c) 
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u K (m±l,z) = u K (m,z)+ — [S(P_ L+m± ^-S(P R _ m± ^], 



z>0, 



(12d) 



which illustrates in a very instructive manner how the 
diffraction pattern changes when the point of observa- 
tion is moved so that new portions of the wavefront are 
covered and uncovered by the aperture edges. The 
recursion formulae (12c,d) are convenient for practical 
applications as they allow the computation of succes- 
sive values without performing the summations of 
Eqs. (12a,b) for every point of observation. 

The results obtained for the diffraction profiles of 
slits are similar to those presented in Sec. 3.3.2.2 for 
circular apertures. They were included in Ref. [12] and 
are omitted here. 



of u K (x, z) and u K (x, z) spaced by Ax = 7t/5& = 0. 1 X. 
The starting values, w K (0, z) = 0.5 and k k (0, z) = 0, 
were obtained by performing the summations in 
Eqs. (13a,b) for m = Q. 

The corresponding expressions according to 
Sommerfeld's rigorous theory [7, 9, 12] for the border 
case of a normally incident geometric field are 



,(/v) 



Kg ' (x, z) = u s (x, z)±u s (x,z), z ^ , 

u s (x, z) = exp(i kz) V(p), p = 421 X [- sign (z) 

yjr + x + yjr - x], z ^ , 



(14a) 



(14b) 



ii s (x, z) = exp(-i kz) V(p), p = JlTI [" sign (z) ^ 
-Jr + x—ylr — x],z^0, 



3.4 Half-Plane Results and Comparison With 
Sommerfeld's Theory 

The aperture depicted in Fig. 7 is reduced to an in- 
finitesimally thin half plane defined by x > 0, z = 0, by 
letting L=-°°, R = 0. Accordingly, Eqs. (12a-d) are 
now replaced by 



« K (x, z) = J dv S(P) = u K (m, z) = Av £ S(P n ) , 
u K (m±l,z) = u K (m,z) + AvS(P m±l ), z>0, 



(13a) 



kx — m 

u K (x,z) = J dv S(P) = u K (m, z) = Av £ S{P n ) , 



(13b) 



« K (m±l,z) = u K (m,z) + Av S(p m±l ) , z >0 , 



where it should be noted that the last terms on the right- 
hand sides of Eqs. (12c,d) are now absent because 
there is no right aperture edge. Here, as above, kx = mAv, 
k(E,-x) = nAv, S(P„) and S(P„), being the same as in 

(llb,c), and P n = ^{n -\) 2 Av 2 +(kz) 2 . The summa- 

tion elements Av must again satisfy the quarter-wave 
criterion, so that the phase difference between succes- 
sive summation elements, A/J = /}„ + 1 - P n , must not 
exceed %I2. This phase difference has a maximum 
value, (A/J ) max = Av when z = 0, and hence it follows 
that the quarter-wave condition will always be satisfied 
when Av < Til 2. As the choice of Av also determines 
the step size of the recursions (13a,b) the value chosen 
in this work was Av = 7t/5 , yielding equidistant values 



V(p) = 



exp(-i7z74) 

~1T 



drexp( 



i7ZT 2 /4) =-[l+C(p) 



(14d) 



+ S(p)]--[C(p)-S(p)], 



where u^ ,s) (x,z) satisfy the wave equation and the re- 



spective boundary conditions duf'ldz =0 and Kg : 



=0, 



k s and u s are the forward and reverse wave functions, 
V(p) is the complex Fresnel integral the form used by 
Sommerfeld, 6 while C(p) and S(p) are the standard 
Fresnel cosine and sine integrals. 

During the course of this work, the above sets of 
equations were used for numerous computations 
intended to quantify the residual differences between 
them. The results obtained showed consistently that 
these differences were insignificant even in the imme- 
diate vicinity of the diffracting edge. As an example, 
Fig. 8 shows the respective forward irradiance distribu- 
tions, | u K | 2 and | u s | 2 , at the distances z = + X from the 
half plane. The similarity of the curves is unmistakable, 
and further computations showed that the same quanti- 
ties differ by less than ±0.01 for z = ±\0X. This 
demonstrates the rigor of the equations derived in 
Sec. 3.2 for the specific case of half-plane diffraction, 
and it may be inferred that the corresponding expressions 
for circular apertures and slits are equally as rigorous. 



Different notations have been used elsewhere. For example, 
in Ref. [17] therse relationships were expressed of the function 



F(-V7r72 p) =exp(i nl4)4nV{p) . 
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-2.5 







x/X 2.5 



Fig. 8. Forward irradiance profiles E K at distances 

z = ±X from a half plane according to Eqs. (13a) ( ) 

and (14b) ( ). 



4. A Note on Electromagnetic Diffraction 
Theories 

The thoughts presented in this paper arose from a 
realization that, at some time in the mid 1900's, the 
theory of diffraction had reached an impasse. Although 
it should have been obvious that the inability to account 
for polarization phenomena is an intrinsic feature of the 
scalar theory of light, and although there was no exper- 
imental evidence that this is a practical problem in the 
context of Fresnel's theory, 7 attempts were made to 
modify the theory so that, in a manner of speaking, 
polarization is introduced through the back door and 
the scalar theory assumes a pseudo-vectorial character. 
These modifications were justified as follows [e.g., 10, 
16]. 

For infinite plane screens and plane-parallel incident 
light, the Helmholtz integral theorem [5] has two mutu- 
ally independent solutions, m w and u {s \ which include 
but are not limited to the Rayleigh-Sommerfeld 

and Sommerfeld solutions 1% an ^ Kg discussed in 



Sec. 3 and obey the same boundary conditions as those 
pertaining to the reflection of polarized light at a perfect- 
ly conducting metallic reflector. Hence it is surmised, 
without further proof, that these solutions can be regarded 
as the components of mutually independent electromag- 
netic fields given by H (rt = Je7Ji(0, u (p) ,0), i m (p) = 
-Jji/e curlH (rt and E w =JjJe(0,u (s) ,0), i£H w = 
^e//J. curl E (s \ where SI units are used and ^jM sis 
the wave impedance of free space. 

It is easy to show that these assumptions are fallacious 
and can lead to contradictory or dubious results. For 
example: 

According to the above, unpolarized light incident 
on a diffracting screen is transmitted as partially polar- 
ized light having a polarization ratio given by 
n = Iw^l^lw^fand ordinarily this ratio would not 
be unity in the near zone. However, in the mid zone 
h w and u (s) are expected to be same so that II = 1 . 
This is absurd, because the polarization of light can- 
not change while propagating in free space. 

The curl operators in Maxwell's equations are known 
to introduce singularities into the "electromagnetic" 
theory, even when the underlying wave functions u ip) 
and w w are everywhere finite and continuous. These 
singularities have been interpreted as evanescent 
edge waves emitted by the diffracting edge, as origi- 
nally presumed by Thomas Young but soon refuted 
by Fresnel. 8 The notion of edge waves has persisted 
as a means to explain singularities encountered in 
diffraction calculations [18], whereas a thorough 
analysis might have revealed that these singularities 
are artifacts of mathematical errors or illogical 
assumptions. 

The reason for inconsistencies of this kind is that 
the conditions h w = and 3m w /3z = are necessary 
and sufficient to satisfy Helmholtz' theorem, but in- 
sufficient to transform the scalar theory of light into a 
viable tool for explaining electromagnetic phenomena. 



This may be due to the paraxial approximations made in Fresnel's 
theory. The existence of polarization effects at large angles is, of 
course, evidenced by the properties of diffraction gratings. 



It appears that Young did not publish this hypothesis and that 
Fresnel was the first to mention it. 
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